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I | the drawings, 


sheets/fig , as originally filed, 
sheets/fig , filed with the demand, 
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| 1 paid additional fees. 
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2. This Authority found that the requirement of unity of invention is not complied with and chose, according to Rule 
68. 1, not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13. 1, 13.2 and 13.3 is 

|x| complied with. 

| | not complied with for the following reasons: 



4. Consequently, the following parts of the international application were the subject of international preliminary examination 
in establishing this report: 

fx] all parts. 

| [ the parts relating to claims Nos. 
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1-41 
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Claims 
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Claims 


1-41 


NO 


Industrial applicability (IA) 


Claims 


1-14 
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Claims 


NONE 
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2. Citations and explanations 

Document 1 

AU 60798/90 (CALIFORNIA BIOTECHNOLOGY INC) published 21 February 1991 

Figure 6 in document 1 discloses isolated cDNA sequences encoding bVEGF^o and bVEGFi^ both these sequences show 
at least 15% similarity and 5% dissimilarity to the sequence set forth in sequence ID#2. 

Document 2 

AU 56574/90 (GENENTECH, INC) published 15 November 1990 

Figures 2 and 12 disclose amino acid sequences having at least 15% similarly and 5% dissimilarity to the sequence set 
forth in sequence ID#2. 

Document 3 

Biochemical and Biophysical Research Communication (1989), Volume 165, No. 3 page 1 198-1206 (Edmund Tischer et 
al.) "VASCULAR ENDOTHELIAL GROWTH FACTOR: A NEW MEMBER OF THE PLATELET-DERIVED 
GROWTH FACTOR GENE FAMILY" See whole Article. 

Sequence of figure 3 in document 3 has at least 15% similarity and 5% dissimilarity to the sequence set forth in sequence 
ID#2. 

Document 4 

Journal of Virology (1994), Volume 68, No. 1 page 84-92 (David J Lyttle et al.) "HOMOLOGS OF VASCULAR 
ENDOTHELIAL GROWTH FACTOR ARE ENCODED BY THE POX VIRUS ORF VIRUS". See whole Article. 

Figure 5 of document 4 discloses Orf virus NZ7 and NZ2 genes encoding polypeptides which are at least 15% similar and 
5% dissimilar to the sequence set forth in sequence ID#2. 
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Document 5 

Methods in Enzymology (1991), Volume 198, page 391-405 (Ferrara Napoliana et al.) "PURIFICATION AND 
CLONING OF VASCULAR ENDOTHELIAL GROWTH FACTOR SECRETED BY PITUITARY 
FOLLICULOSTELLATE CELLS". See whole Article. 

Figure 1 in document 5 discloses bovine VEGF cDNA clone encoding amino acid sequence that is at least 15% similar 
and 5% dissimilar to the sequence set forth in sequence ID#2. 



The Journal of Biological Chemistry (1991), Volume 266, No. 18 page 1 1947-1 1954 (Edmund Tischer et al.,) "THE 
HUMAN GENE FOR VASCULAR ENDOTHELIAL GROWTH FACTOR". See whole Article. 

Figure 2 of document 6 discloses PCR products PCR-1 and PCR-3 having sequences that are at least 15% similar and 5% 
dissimilar to the sequence set forth in sequence ID#2. 



DEVELOPMENT (1992), Volume 1 14, page 521-532 (Breier G et al.,) "EXPRESSION OF VASCULAR 
ENDOTHELIAL GROWTH FACTOR DURING EMBRYONIC ANGIOGENESIS AND ENDOTHELIAL CELL 
DIFFERENTIATION". See whole Article. 

Figure 3 of document 7 discloses mouse, bovine and rat VEGF protein sequences that are at least 15% similar and 5% 
dissimilar to the sequence set forth in sequence ID#2. 



Molecular Endocrinology (1991), Volume 5 No. 12, page 1806-1814 (Houck Keith A et al.,) "THE VASCULAR 
ENDOTHELIAL GROWTH FACTOR FAMILY: IDENTIFICATION OF A FOURTH MOLECULAR SPECIES 
AND CHARACTERISATION OF ALTERNATIVE SPLICING OF RNA" See whole Article. 

Figure 2 in document 8 discloses comparison between multiple VEGF molecular species VEGF 121, VEGF 189 and 
VEGF 206 resulting from alternative splicing of VEGF RNA. The amino acid sequences of these alternative species are 
15% similar and 5% dissimilar to the sequence set forth in sequence ID#2. 

Documents 9 and 10 

US Patent Numbers: 5219739 and 5 194596 disclose amino acid sequences of human and bovine VEGF^i and VEGFi 20 
species. These sequences are at least 15% similar and 5% dissimilar to the sequence set forth in sequence ID#2. 

Document 11 

Science, Volume 246, 8 December 1989 page 1309-12. 
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Document 7 



Document 8 
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Document 1 1 in figure 1 discloses a VEGF species that is 15% similar and 5% dissimilar to the sequence set forth in 
sequence ID#2. 

Documents 12 and 13 

Science Volume 264, 8 December 1989 page 1306-1309 and US Patent number 5332671. 

Both these documents disclose bovine VEGF 16 4 amino acid species that is at least 15% similar and 5% dissimilar to the 
sequence ID#2. 

Document 14 

Biochimicaet Biophysica Acta 1260 (1995) 235-238. 

Document 14 discloses Porcine vascular entothelial growth factor that has an amino acid sequence that is 15% similar and 
5% dissimilar to the sequence set forth in sequence ID#2. 

Claims 1-41 as drafted include within their scope documents disclosing proteinaceous molecules or recombinant molecules 
comprising amino acid sequence having at least 15% similarity and 5% dissimilarity to the sequence set forth in sequence 
ID No. 2 and exhibit at least one property in common with VEGF. 

All the above documents numbered 1-14 disclose proteins having at least 15% amino acid sequence similarity and 5% 
dissimilarity to the sequence set forth in sequence ID No. 2 and exhibit at least one property in common with VEGF. 
Therefore all these documents fall within the scope of claims 1-41 rendering them not novel and not inventive. 

Claim 33 defining a nucleic acid molecule encoding murine homologue of human VEGF comprising nucleotide sequence 
of Figure 9 lacks an inventive step in the light of Document 7 disclosing mouse and rat VEGF protein sequences. 

The citation discloses mouse and rat VEGF genes and similarity to the sequence of Figure 9. Common General 
Knowledge of Skilled addressee would use probes based on known sequence to get murine homologue of human VEGF. 
Therefore claim 33 lacks an inventive step. 
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Last two claims have been numbered 41. 
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Vni. Certain observations on the international application 



The following observations on the clarity of the claims, description, and drawings or on the question whether the claims are fully 
supported by the description, are made: 



Claims defining portions, parts, fragments and derivatives of the present sequences are mdeterminate in scope. 
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A NOVEL GROWTH FACTOR AND A GENETIC SEQUENCE 

ENCODING SAME 

The present invention relates generally to an isolated molecule having vascular 
endothelial growth factor-like properties and to a genetic sequence encoding same. The 
molecule will be useful in the development of a range of therapeutics and diagnostics 
useful in the treatment, prophylaxis and/or diagnosis of conditions requiring enhanced 
or diminished vasculature and/or vascular permeability. The molecule of the present 
invention is also a useful effector of primary and central neurons and is capable of 
inducing astroglial proliferation. 

Bibliographic details of the publications referred to by author in this specification are 
collected at the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for 
the nucleotide and amino acid sequences referred to in the specification are defined 
following the bibliography. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated element or integer or group of elements or integers but 
not the exclusion of any other element or integer or group of elements or integers. 

Vascular endothelial growth factor (hereinafter referred to as "VEGF"), also known as 
vasoactive permeability factor, is a secreted, covalently linked homodimeric glycoprotein 
that specifically activates endothelial tissues (Senger et al. t 1993). A range of functions 
have been attributed to VEGF such as its involvement in normal angiogensis including 
formation of the corpus luteum (Yan et al., 1993) and placental development (Sharkey 
et al., 1993), regulation of vascular permeability (Senger et aL, 1993), inflammatory 
angiogenesis (Sunderkotter et al, 1994) and autotransplantation (Dissen et aL, 1994) and 
human diseases such as tumour promoting angiogenesis (Folkman & Shing, 1992), 
rheumatoid arthritis (Koch et aL, 1994) and diabetes related retinopathy (Folkman & 
Shing, 1992). 
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VEGF is, therefore, an important molecule making it a potentially valuable target for 
research into therapeutics, prophylactics and diagnostic agents based on VEGF or its 
activities. There is also a need to identify homologues or otherwise related molecules 
for use as an alternative to VEGF or in conjunction with VEGF. 

In work leading up to the present invention, the inventors sought the multiple endocrine 
neoplasia type I susceptibility gene (MEN1). Surprisingly, the inventors discovered that 
a genetic sequence excluded as a candidate for the MEN1 gene was nevertheless a new 
growth factor having some similarity to VEGF. Furthermore, the growth factor of the 
present invention is an effector molecule for primary and central neurons. 

Accordingly, one aspect of the present invention comprises a biologically isolated 
proteinaceous molecule comprising a sequence of amino acids which: 

(i) is at least about 15% similar to the amino acid sequence set forth in SEQ ID 
NO:2; and 

(ii) is at least 5% dissimilar to the amino acid sequence set forth in SEQ ID NO:2. 

Another aspect of the present invention provides a biologically isolated proteinaceous 
molecule having the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to all or part of the amino acid sequence set forth 
in SEQ ID NO:2; 

(ii) exhibits at least one property in common with VEGF. 

A related aspect of the present invention contemplates a biologically isolated 
proteinaceous molecule having the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to the amino acid sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one of the following properties: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) ability to interact with flt-\fflk-\ family of receptors; 
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(c) ability to induce cell migration, cell survival and/or an increase in 
intracellular levels of alkaline phosphatase. 

By "biologically isolated" is meant that the molecule has undergone at least one step of 
purification from a biological source. Preferably, the molecule is also biologically pure 
meaning that a composition comprises at least about 20%, more preferably at least about 
40%, still more preferably at least about 65%, even still more preferably at least about 
80-90% or greater of the molecule as determined by weight, activity or other convenient 
means, relative to other compounds in the composition. Most preferably, the molecule 
is sequencably pure. 

Another preferred aspect of the present invention provides the molecule in recombinant 
form. 

According to this aspect of the present invention, there is provided a recombinant 
molecule comprising a sequence of amino acids which: 

(i) is at least about 15% similar to the amino acid sequence set forth in SEQ ID 
NO:2; and 

(ii) is at least 5% dissimilar to the amino acid sequence set forth in SEQ ID NO:2. 

A related aspect of the present invention is directed to a recombinant molecule having 
the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to all or part of the amino acid sequence set forth 
in SEQ ID NO:2; 

(ii) exhibits at least one property in common with VEGF. 

A further related aspect of the present invention contemplates a recombinant molecule 
having the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to the amino acid sequence set forth in SEQ ID 

NO:2; 
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(ii) exhibits at least one of the following properties: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) ability to interact with flt-\lflk-\ family of receptors; 

(c) ability to induce cell migration, cell survival and/or an increase in 
5 intracellular levels of alkaline phosphatase. 

The present invention also contemplates genomic or partial genome clones encoding a 
proteinaceous molecule having at least about 15% amino acid similarity but at least 
about 5% dissimilarity to SEQ ID NO:l. 

10 

The amino acid sequence set forth in SEQ ID NO:2 corresponds to human VEGF 
(referred to herein as n VEGF 165 "). Accordingly, the molecule of the present invention 
is VEGF-like or is a homologue of VEGF but comprises an amino acid sequence which 
is similar but non-identical to the amino sequence of VEGF. Although the present 

15 invention is exemplified using a human VEGF-like molecule, this is done with the 
understanding that the instant invention contemplates the homologous molecule and 
encoding sequence from other mammals such as livestock animals (e.g. sheep, pigs, 
horses and cows), companion animals (e.g. dogs and cats) and laboratory test animals 
(e.g. mice, rats, rabbits and guinea pigs) as well as non-mammals such as birds (e.g. 

20 poultry birds), fish and reptiles. In a most preferred embodiment, the VEGF-like 
molecule is of human origin and encoded by a gene located at chromosome 1 lql3. The 
present invention extends, therefore, to the genomic sequence or part thereof encoding 
the subject VEGF-like molecule. 



25 Preferably, the percentage similarity is at least about 30%, more preferably at least about 
40%, still more preferably at least about 50%, still even more preferably at least about 
60-70%, yet even more preferably at least about 80-95% to all or part of the amino acid 
sequence set forth in SEQ ID NO:2. 

30 In a particularly preferred embodiment, the VEGF-like molecule of the present invention 
comprises a sequence of amino acids as set forth in SEQ ID NO:4 or is a part, fragment, 
derivative or analogue thereof. Particularly preferred similarities include about 19-20%, 
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and 29-30%. Reference herein to derivatives also includes splice variants. Accordingly, 
the present invention extends to splice variants of SOM1 75. Examples of splice variants 
contemplated by the present invention include but are not limited to variants with an 
amino acid sequence substantially as set forth in at least one of SEQ ID NO:6, SEQ ID 
5 NO:8 and/or SEQ ID NO: 10 or mutants or derivatives or further splice variants thereof. 

Another embodiment provides a recombinant molecule having the following 
characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:4 or having at 
10 least about 15% similarity to all or part thereof provided that said amino acid 

sequence is at least about 5% dissimilar to all or part of the amino acid sequence 
set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

15 Another embodiment provides a recombinant molecule having the following 
characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:6 or having at 
least about 15% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acid sequence 

20 set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

Another embodiment provides a recombinant molecule having the following 
characteristics: 

25 (i) an amino acid sequence substantially as set forth in SEQ ID NO: 8 or having at 
least about 15% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acid sequence 
set forth in SEQ ID NO:2; 
(ii) exhibits at least one biological property in common with VEGF. 

30 

Another embodiment provides a recombinant molecule having the following 
characteristics: 
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(i) an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having at 
least about 15% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acH sequence 
set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

Such properties of VEGF include at least one of: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) an ability to interact with flt~\lflk-\ family of receptors; 

(c) an ability to induce cell migration, cell survival and/or an increase in intracellular 
levels of alkaline phosphatase. 

In accordance with the present invention, a preferred similarity is at least about 40%, 
more preferably at least about 50% and even more preferably at least about 65% 
similarity. 

Still a further aspect of the present invention contemplates a peptide fragment 
corresponding to a portion of the amino acid sequence set forth in SEQ ID NO:4 or a 
splice variant thereof such as set forth in SEQ ID NO:6, SEQ ID NO:8 or SEQ ID 
NO: 10 or a chemical equivalent thereof. The biologically isolated or recombinant 
molecule of the present invention may be naturally glycosylated or may comprise an 
altered glycosylation pattern depending on the cells from which it is isolated or 
synthesised. For example, if produced by recombinant means in prokaryotic organisms, 
the molecule would be non-glycosylated. The molecule may be a full length, naturally 
occurring form or may be a truncated or otherwise derivatised form. 

Yet another aspect of the present invention is directed to a nucleic acid molecule 
encoding the VEGF-like molecule herein described. More particularly, the present 
invention provides a nucleic acid molecule comprising a sequence of nucleotides 
substantially as set forth in SEQ ID NO:3 or having at least 15% similarity to all or part 
thereof or being capable of hybridising under low stringency conditions to a reverse 
complement of the nucleotide sequence as set forth in SEQ ID NO:3 provided that the 
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nucleic acid sequence having at least 15% similarity but at least 30% dissimilarity to the 
nucleotide sequence as set forth in SEQ ID NO:3. The nucleotide sequence set forth in 
SEQ ID NO:3 is also referred to herein as "SOM175". Preferably, the percentage 
dissimilarity is about 35%, more preferably about 39% and even more preferably about 
5 40-50% or greater. 

For the purposes of defining the level of stringency, reference can conveniently be made 
to Sambrook et al (1989) at pages 9.47-9.51 which is herein incorporated by reference 
where the washing steps disclosed are considered high stringency. A low stringency is 

10 defined herein as being in 4-6X SSC/0.1-0.5% w/v SDS at 37-45°C for 2-3 hours. 
Depending on the source and concentration of nucleic acid involved in the hybridisation, 
alternative conditions of stringency may be employed such as medium stringent 
conditions which are considered herein to be 1-4X SSC/0.25-0.5% w/v SDS at > 45°C 
for 2-3 hours or high stringent conditions considered herein to be 0.1- IX SSC/0.1% w/v 

15 SDS at 60°C for 1-3 hours. 

The present invention further contemplates a nucleic acid molecule which encodes a 
VEGF-like molecule as hereinbefore described having at least 1 5% nucleotide sequence 
homology to SEQ ID NO: 3. Preferred levels of homology include at least about 40%, 
20 more preferably around 60-70%. 

The present invention is further directed to the murine homologue of human VEGF 
(referred to herein as "mVRF"). The mVRF has approximately 85% identity and 92% 
conservation of amino acid residues over the entire coding region compared to human 
25 VEGF. The mVRF is encoded by a nucleic acid molecule comprising a nucleotide 
sequence substantially as set forth in Figure 9. 

The VEGF-like molecule of the present invention will be useful in the development of 
a range of therapeutic and/or diagnostic applications alone or in combination with other 
30 molecules such as VEGF. The present invention extends, therefore, to pharmaceutical 
compositions comprising the VEGF-like molecule or parts, fragments, derivatives, 
homologues or analogues thereof together with one or more pharmaceutically acceptable 
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carriers and/or diluents. Furthermore, the present invention extends to vectors 
comprising the nucleic acid sequence set forth in SEQ ID NO:3 or having at least about 
15%, more preferably about 40% and even more preferably around 60-70% similarity 
thereto but at least 30% and more preferably around 39% dissimilarity thereto and host 
cells comprising same. In addition, the present invention extends to ribozymes and 
antisense molecules based on SEQ ID NO:3 as well as neutralizing antibodies to the 
VEGF-like molecule. Such molecules may be useful in ameliorating the effects of, for 
example, over expression of VEGF-like genes leading to angiogenesis or vascularization 
of tumours. 

Another aspect of the present invention contemplates a method of inducing astroglial 
proliferation in a mammal, said method comprising administering to said mammal an 
effective amount of a recombinant proteinaceous molecule having the characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF), 

said administration being for a time and under conditions sufficient to induce astroglial 
proliferation. 

Preferably, the recombinant proteinaceous molecule comprises the amino acid sequence 
set forth in SEQ ID NO:3 or SEQ ID NO:6. 

A further aspect of the present invention provides a method of promoting neural survival 
and/or proliferation in a mammal, said method comprising administering to said mammal 
an effective amount of a recombinant proteinaceous molecule having the characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF), 
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said administration being for a time and under conditions sufficient to induce astroglial 
proliferation. 

Preferably, the recombinant proteinaceous molecule comprises the amino acid sequence 
5 set forth in SEQ ID NO:3 or SEQ ID NO:6. 

The present invention also contemplates antibodies to the VEGF-like molecule or nucleic 
acid probes to a gene encoding the VEGF-like molecule which are useful as diagnostic 
agents. 

10 

The present invention is further described by reference to the following non-limiting 
Figures and/or Examples. 

In the Figures: 

15 

Figure 1 Nucleotide sequence [SEQ ID NO:l] and corresponding amino acid 
sequence [SEQ ID NO:2] of VEGF 165 . 

Figure 2 Nucleotide sequence [SEQ ID NO:3] and corresponding amino acid 
20 sequence [SEQ ID NO:4] of SOM175. 

Figure 3 Results of BLAST search with SOM175 protein sequence. 
Figure 4 BESTFIT alignment of VEGF cDNA and SOM175 cDNA. 

25 

Figure 5 Multiple alignment of VEGF 165 with SOM175 and its splice variants at 
the nucleotide level. 

Figure 6 Multiple alignment of VEGF 165 with SOM175 and its splice variants at 
30 the amino acid level. 



Figure 7 Diagrammatic representation of SOM175 and its splice variants. 
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Figure 8(a) Diagrammatic representation of genomic structure of human SOM175 
genomic showing exon/intron map. 

Figure 8(b) Diagrammatic representation of genomic structure of human SOM175 
5 showing exon/intron boundries. 

Figure 9 Nucleotide and predicted peptide sequences derived from mVRF cDNA 
clones. Numbering of nucleotides are given on the left, starting from the A of the 
initiation codon. Amino acids are numbered on the right, starting from the first residue 

10 of the predicted mature protein after the putative signal peptide has been removed. The 
alternately spliced region is double underlined and the resulting peptide sequence from 
each mRNA is included. A potential polyadenylation signal is indicated in boldface. 
Start and stop codons of mVRF 167 and mVRF lg5 are underlined and a polymorphic AC 
repeat in the 3 ' UTR is indicated by a stippled box. The positions of intron/exons 

1 5 boundaries are indicated by arrowheads. 

Figure 10 BESTFIT alignments of human and murine VRF protein isoforms. A: 
mVRF 167 and hVRF 167 . B: mVRF 186 and hVRF, g6 from the point where the sequences 
diverge from the respective 167 amino acid isoforms. Amino acid identities are marked 
20 with vertical bars and conserved amino acids with colons. An arrow marks the predicted 
signal peptide cleavage site of human and mouse VRF. 

Figure 11 BESTFIT alignment of mVRF 167 and mVEGF 188 (Breier et a!, 1992) 
peptide sequences. An arrow marks the signal peptide cleavage site of mVEGF. 
25 Identical amino acids are indicated by vertical bars and conservative substitutions by 
colons. Numbering of amino acids is as described in the legend to Figure 9. 

Figure 12 Comparison of gene structure between VRF (a generic VRF gene is 
shown since the intron/exon organisation of the mouse and human homologues is almost 
30 identical) and other members of the human VEGF/PIGF/PDGF gene family. Exons are 
represented by boxes. Protein coding regions and untranslated regions are shown by 
filled and open sections respectively. The hatched region in VRF indicates the 
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additional 3' UTR sequence formed by alternate splicing of the VRF 186 isoform. 
Potential alternate splice products of each gene are shown. 



Figure 13 Autoradiogram of a Northern blot of total RNA from various adult mouse 
5 tissues (as indicated) hybridised with an mVRF cDNA clone. A major transcript of 1.3 
kb was detected in all samples. 



Figure 14 Film autoradiographs (A-C) and dark-field micrographs (D-E) illustrating 
the expression pattern of mVRF and mRNA in the mouse. In the E14 mouse embryo 

10 (A) positive signals are present over the developing heart (Ha) and cerebral cortex (Cx). 
A low background signal is also present over other tissues in the section. In the El 7 
embryo (B) and the heart (Ha) is clearly visible due to a strong hybridisation signal. An 
equally strong signal is present over brown adipose tissue (Fa) in the back and around 
the thoracic cage. A moderate hybridisation signal is present over the spinal cord (SC) 

15 and the tongue (T). The background signal is reduced compared with the E14 embryo. 
In the young adult mouse (C-D), positive signals are present over the heart (Ha) and 
adipose tissue (Fa) around the thoracic cage, while, for example, the lungs (Lu) are 
unlabeled). The hybridisation signal over the heart is evenly distributed over the entire 
left ventricle, including papillary muscles (D). In the El 7 heart hybridised with an 
20 excess of cold probe, no positive signal is present (E). Scale bars = 0.5 mm (A), 1.2 
mm (B), 1 mm (C), 0.3 mm (D), 0.1 mm (E). 



Figure 15 Dark - (A and C) and bright-field (B and D) micrographs showing mVRF 
mRNA expression in mouse adipose tissue (A-B) and spinal cord (C-D). A strong 

25 hybridisation signal is present over fat (A), as shown by the strong labeling in Sudan 
black stained sections (B). A weak signal is present also in skeletal muscle (M in A-B). 
In the adult spinal cord (C) the mVRF probes gave a neuronal staining pattern over the 
gray matter. Toloudine counterstaining showing that motoneurons in the ventral horn 
(D), interneurons in the deep part of the dorsal horn and around the central canal (not 

30 shown) where largely positive for mVRF mRNA. Scale bars = 0. 1 mm (A), 0. 1 mm 
(B), 0.25 mm (C), 0.015 mm (D). 
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Figure 16 Effect of VEGF on embryonic day 8 (E8) chick sensory neurons as 
determined by % survival, % neurite outgrowth and average neurite length (^im). 

Figure 17 Effects of VEGF and SOM175 on chick glia. Tested were CNS glial, 
5 peripheral glia and CNS oligodendrocytes. 

Figure 18 Effect of various SOM175 proteins on mouse astroglial cells. ■ 3 H (cpm) 



1. 


FGF-2 (10 ng/ml) positive control 


2. 


SOMaX6* 1 ng/ml 


3. 


SOMaX6 10 ng/ml 


4. 


SOMaX6 100 ng/ml 


5. 


SOMaX6 1000 ng/ml 


6. 


SOMaX6 1000 ng/ml, no heparin 


7. 


SOMX6** 1 ng/ml 


8. 


SOMX6 10 ng/ml 


9. 


SOMX6 100 ng/ml 


10. 


SOMX6 1000 ng/ml 


11. 


SOMX6 1000 ng/ml, no heparin 



* This refers to SOM175 absent exon 6; 
20 ** This refers to SOM175. 

Figure 19 Effect of various SOM175 proteins on mouse oligodenroglial cells. ■ 3 H 
(cpm) 





1. 


FGF-2 (10 ng/ml) positive control 


25 


2. 


SOMaX6* 1 ng/ml 




3. 


SOMaX6 10 ng/ml 




4. 


SOMaX6 100 ng/ml 




5. 


SOMaX6 1000 ng/ml 




6. 


SOMaX6 1000 ng/ml, no heparin 


30 


7. 


SOMX6** 1 ng/ml 




8. 


SOMX6 10 ng/ml 




9. 


SOMX6 100 ng/ml 
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10. SOMX6 1000 ng/ml 

11. SOMX6 1000 ng/ml, no heparin 
This refers to SOM175 absent exon 6; 

** This refers to SOM175. 

5 

Figure 20 Effect of various SOM175 proteins on mouse forebrain neurons. B % 



survival 




1. 


FGF-2 (10 ng/ml) positive control 


2. 


SOMaX6* 1 ng/ml 


3. 


SOMaX6 10 ng/ml 


4. 


SOMaX6 100 ng/ml 


5. 


SOMaX6 1000 ng/ml 


6. 


SOMaX6 1000 ng/ml, no heparin 


7. 


SOMX6** 1 ng/ml 


8. 


SOMX6 10 ng/ml 


9. 


SOMX6 100 ng/ml 


10. 


SOMX6 1000 ng/ml 


11. 


SOMX6 1000 ng/ml, no heparin 



This refers to SOM175 absent exon 6; 



20 This refers to SOM175. 
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TABLE 1 

SUMMARY OF SEQUENCE IDENTITY NUMBERS 



5 



SEQ ID NO:l 


Nucleotide sequence of VEGF 165 


SEQ ID NO:2 


Amino acid sequence of VEGF 165 


SEQ ID NO:3 


Nucleotide sequence of SOM175 (VEGF-like molecules) 


SEQ ID NO:4 


Amino acid sequence of SOM175 


SEQ ID NO:5 


Nucleotide sequence of SOM175 absent exon 6 


SEQ ID NO:6 


Amino acid sequence of SOM175 absent exon 6 


SEQ ID NO:7 


Nucleotide sequence of SOM175 absent exon 6 and exon 7 


SEQ ID NO:8 


Amino acid sequence of SOM175 absent exon 6 and exon 7 


SEQ ID NO:9 


Nucleotide sequence of SOM175 absent exon 4 


SEQ ID NO: 10 


Amino acid sequence of SOM175 absent exon 4 


SEQ ID NO: 11 


Oligonucleotide 


SEQ ID NO: 12 


Oligonucleotide 


SEQ ID NO:13 


Oligonucleotide 


SEQ ID NO:14 


Oligonucleotide 



20 

EXAMPLE 1 

Human cDNA clones 

The original SOM175 cDNA was isolated by screening a human foetal brain library 
25 (A^apII, Stratagene) with the cosmid Dl 1S750 (Larsson et al f 1992). The plasmid was 
excised "in vivo" and a single 1.1 kb cDNA was obtained. Three independent SOM175 
cDNAs clones were also isolated from a human foetal spleen library (Strategane, Uni- 
zap) using the above-mentioned SOM175 insert as a probe. Three clones were obtained: 
SOM175-4A, -5 A and -6 A. SOM175-5A is an alternately spliced clone with exon 4 
30 being absent (SOM175-e4). These library screens were performed using hybridisation 
conditions recommended by the manufacturer of the library (Stratagene) and random 
primed insert of SOM175. 
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Two partial human SOM175 cDNAs have also isolated from a AGT1 1 human melanoma 
cell line A2058 library (Clontech) cDNA library screens were performed using 
hybridisation conditions described by Church and Gilbert, 1984). In each case, the 
probe was generated by random priming of a PCR product derived from SOM175 (18f- 
5 700r). 

Mouse cDNA Clones 

Human SOM175 was also used to screen a mouse neonatal whole brain cDNA library 
(Unizap, Stratagene). Four non-chimeric clones were isolated: M175-A, B, C, D. All 
10 clones were partial cDNAs and M175-C contained several introns. Three of these 
cDNAs lacked the exon 6. 

Another clone referred to as Ml was completely sequenced and was found to contain 
the full open reading frame plus part of the 5'utr and total 3'utr. 

15 

EXAMPLE 2 
DNA SEQUENCE ANALYSIS 
The entire sequence of the cDNA clone (SOM175) was compiled and is shown in 
Figure 2 with its corresponding amino acid sequence. This sequence was screened for 
20 open reading frames using the MAP program (GCG, University of Wisconsin). A single 
open reading frame of 672bp was observed (see Figure 2). There appears to be little 5' 
untranslated sequences (2bp). The 3' untranslated region appears to be complete as it 
includes a poly-adenylation signal and poly-A tail. 

25 Database homology searches were performed using the BLAST algorithm (run at NCBI, 
USA). This analysis revealed homology to several mammalian forms of VEGF (see 
Figure 3). The amount of homology between SOM175 and human VEGF 165 was 
determined using the BESTFIT program (GCG, University of Wisconsin; see Figures 
4 and 5). Nucleotide homology was estimated at 69.7% and protein homology was 

30 estimated as at least 33.3% identity and 52.5% conservation using BESTFIT analysis. 
BLAST analysis on nucleotide sequences revealed the almost complete match to a 
human expressed sequence tag EST06302 (Adams et al, 1993). 
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These data indicate that SOM175 encodes a growth factor that has structural similarities 
to VEGF. Both genes show start and stop codons in similar positions and share discrete 
blocks of homology. All 8 cysteines as well as a number of other VEGF residues 
believed to be involved in dimerisation are conserved. These residues are Cysteine-47, 
5 Proline-70, Cysteine-72, Valine-74, Arginine-77, Cysteine-78, Glycine-80, Cysteine-81, 
Cysteine-82, Cysteine-89, Proline-91, Cysteine- 122 and Cysteine- 124 and are shown in 
Figure 6. Given the structural conservation between VEGF and the SOM175 gene 
product it is also possible that they share functional similarities. It is proposed that 
SOM175 encodes a VEGF-like molecule that shares some properties with VEGF but has 
10 unique properties of its own. The nucleotide sequence and corresponding amine* acid 
sequence of VEGF 165 is shown in Figure 1. 



EXAMPLE 3 

The percentage similarity and divergence between VEGF 165 family and SOM175 family 
15 (protein) were analysed using the Clustal method, MegAlign Software, DNASTAR, 
Wisconsin. The results are shown in Tables 2.1 and 2.2. The alternatively spliced 
forms of SOM175 are abbreviated to SOM715-e6 where all of exon 6 is deleted; 
SOM715-e6 and 7 where all of exons 6 and 7 are deleted; and SOM175-e4 where all 
of exon 4 is deleted. The spliced form of SOM175 are shown in Figure 7. Genomic 
20 maps of SOM175 showing intron/exon boundaries are shown in Figure 8a and 8b. 
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Table 2.1 

A Percent nucleotide similarity between splice variants of SOM175 and 
5 human VEGF,*, 





VEGF 16S SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 


VEGF 165 


*** 34.9 


39.7 


41.4 


37.0 


SOM175 


*** 


98.9 


95.1 


99.2 


SOM175-e6 




*** 


98.8 


84.0 


SOM175-e6&7 






*** 


80.3 


SOM175-e4 








*** 



15 
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B Percent nucleotide divergence between splice variants of SOM175 and 
human VEGF,~ 



5 




VEGF 165 SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 




VEGF 165 


*** 41.7 


41.6 


41.7 


41.8 




SOM175 


*** 


0.2 


0.2 


0.0 




SOM175-e6 




*** 


0.0 


0.2 


10 


SOM175-e6&7 






*** 


0.3 




SOM175-e4 








*** 



Table 2.2 

15 A Percent amino acid identity between splice variants of SOM175 and 
human VEGF,~ 





VEGF 165 


SOM175 


SOM175-e6 


SOM175-e6&7 SOM175-e4 


VEGF 165 


*** 


31.4 


42.3 


33.5 


40.6 


SOM175 




*** 


74.7 


73.7 


99.1 


SOM175-e6 






#** 


76.8 


99.1 


SOM175-e6&7 








*** 


99.1 


SOM175-e4 










*%* 



25 
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B Percent amino acid divergence between splice variants of SOM175 and 
human VEGF,« 



5 




VEGF, 65 SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 




VEGF 165 


*** 65.7 


55.4 


54.6 


57.4 




SOM175 


*** 


19.9 


4.2 


0.0 




SOM175-e6 




*** 


0.0 


0.0 


10 


SOM175-e6&7 






*** 


0.0 




SOM175-e4 








*** 



15 EXAMPLE 4 

BIO ASS AYS TO DETERMINE THE FUNCTION OF SOM175 

Assays are conducted to evaluate whether SOM175 has similar activities to VEGF on 
endothelial cell function, angiogenesis and wound healing. Other assays are 
performed based on the results of receptor binding distribution studies. 

20 

Assays of endothelial cell function 

Endothelial cell proliferation. Endothelial cell growth assays as described in Ferrara 
& Henzel (1989) and in Gospodarowicz et al (1989). 

25 Vascular permeability assay. This assay, which utilises the Miles test in guinea pigs, 
will be performed as described in Miles & Miles (1952). 

Cell adhesion assay. The influence of SOM175 on adhesion of polymorphs to 
endothelial cells is analysed. 

30 



Chemotaxis. This is performed using the standard Boyden chamber chemotaxis 
assay. 
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Plasminogen activator assay. Endothelial cells are tested for plasminogen activator 
and plasminogen activator inhibitor production upon addition of SOM175 (Pepper et 
al (1991)). 

5 Endothelial cell migration assay. The ability of SOM175 to stimulate endothelial 
cells to migrate and form tubes is assayed as described in Montesano et al (1986). 

Angiogenesis Assay 

SOM175 induction of an angiogenic response in chick chorioallantoic membrane is 
10 evaluated as described in Leung et al (1989). 

Possible neurotrophic actions of SOM175 are assessed using the following assays: 

Neurite outgrowth assay and gene induction (PC12 cells) 

15 PC 12 cells (a phaeochromocytoma cell line) respond to NGF and other neurotrophic 
factors by developing the characteristics of sympathetic neurons, including the 
induction of early and late genes and the extension of neurites. These cells are 
exposed to SOM175 and their response monitored (Drinkwater et al (1991); and 
Drinkwater et al (1993)). 

20 

Cultured neurons from the Peripheral Nervous System (PNS) 

Primary cultures of the following PNS neurons are exposed to SOM175 and 

monitored for any response: 

- sensory neurons from neural crest and dorsal root ganglia 
25 - sympathetic neurons from sympathetic chain ganglia 

- placode derived sensory neurons from nodose ganglia 

- motoneurons from spinal cord 

The assays are described in Suter et al (1992) and in Marinou et al (1992). 



30 



Where an in vitro response is observed, in vivo assays for properties such as uptake 
and retrograde transport are performed as described in Hendry et al (1992). 
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Nerve regeneration (PNS) 

Where neurotrophic effects of SOM175 are observed, its possible role in the 
regeneration of axotomised sensory neurons, sympathetic neurons and motoneurons is 
analysed by the methods of Otto et al (1989); Yip et al (1984) and Hendry et al 
5 (1976). 

Actions of SOM175 on CNS neurons 

The ability of SOM175 to promote survival of central nervous system neurons is 
analysed as described in Hagg et al (1992); Williams et al (1986); Hefti (1986) and 
10 Kromer (1987). 

Wound Healing 

The ability of SOM175 to support wound healing are tested in the most clinically 
relevant model available, as described in Schilling et al (1959) and utilised by Hunt 
15 et al (1967). 

The Haemopoietic System 

A variety of in vitro and in vivo assays on specific cell populations of the 
haemopoietic system are available and are outlined below: 
20 Stem Cells 
Murine 

A variety of novel in vitro murine stem cell assays have been developed using 
FACS-purified cells: 

25 (a) Repopulating Stem Cells 

These are cells capable of repopulating the bone marrow of lethally irradiated mice, 
and have the Lin", Rh hi , Ly-6A/E + , c-kit + phenotype. The test substance is tested on 
these cells either alone, or by co-incubation with multiple factors, followed by 
measurement of cellular proliferation by 3 H thymidine incorporation. 



30 
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(b) Late Stage Stem Cells 

These are cells that have comparatively little bone marrow repopulating ability but 
can generate D13 CFU-S. These cells have the Lin - , Rh hi , Ly-6A/E + , c-kit + 
5 phenotype. The test substance is incubated with these cells for a period of time, 
injected into lethally irradiated recipients, and the number of D13 spleen colonies 
enumerated. 

(c) Progenitor-Enriched Cells 

1 0 These are cells that respond in vitro to single growth factors, and have the Lin", Rh hi , 
Ly-6A/E + , c-kit + phenotype. This assay will show if SOM175 can act directly on 
haemopoietic progenitor cells. The test substance is incubated with these cells in 
agar cultures, and the number of colonies enumerated after 7-14 days. 

1 5 Atherosclerosis 

Smooth muscle cells play a crucial role in the development or initiation of 
atherosclerosis, requiring a change in their phenotype from a contractile to a synthetic 
state. Macrophages, endothelial cells, T lymphocytes and platelets all play a role in 
the development of atherosclerotic plaques by influencing the growth and phenotypic 
20 modulations of smooth muscle cell. An in vitro assay that measures the proliferative 
rate and phenotypic modulations of smooth muscle cells in a multicellular 
environment is used to assess the effect of SOM175 on smooth muscle cells. The 
system uses a modified Rose chamber in which different cell types are seeded onto 
opposite coverslips. 

25 

Effects of SOM175 on bone 

The ability of SOM175 to regulate proliferation of osteoblasts is assayed as described 
in Lowe et al (1991). Any effects on bone resorption are assayed as described in 
Lowe et al (1991). Effects on osteoblast migration and changes in intracellular 
30 molecules (e.g. cAMP accumulation, alkaline phosphatase levels) are analysed as 
described in Midy et al (1994). 
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Effects on skeletal muscle cells 

Effects of SOM175 on proliferation of myoblasts and development of myotubes can 
be determined as described by Ewton et al (1980) and by Gospodarowicz et al 
(1976). 

5 

EXAMPLE 5 
CLONING MURINE VEGF DNA 

Isolation of cDNAs 

Murine VRF (mVRF) clones were selected from a lambda Zap new born whole brain 
10 cDNA library (Stratagene). Primary phage from high density filters (5 x 10 4 

pfu/plate) were identified by hybridisation with a 682bp 32 P-labelled probe generated 
by PCR from an hVRF cDNA (pSOM175) as described above. Hybridisation and 
stringent washes of nylon membranes (Hybond-N) were carried out at 65°C under 
conditions described by Church and Gilbert (1984). Positive plaques were picked, 
1 5 purified and excised in vivo to produce bacterial colonies containing cDNA clones in 
pBluescript SK-. 

Isolation of genomic clones 

Genomic clones were isolated from a mouse strain SV/129 library cloned in the 
20 lambda Fix II vector (Stratagene). High density filters (5 x 10 4 pfu/filter) were 
screened with a 563 bp 32 P-labelled probe generated by PCR amplification of the 
nucleotide 233-798 region of the mVRF cDNA (see Figure 9). Positive clones were 
plugged and re-screened with filters containing 400-800 pfu. Large scale phage 
preparations were prepared using the QIAGEN lambda kit or by ZnCl 2 purification 
25 (Santos, 1991). 

Nucleotide sequencing and analysis 

cDNAs were sequenced on both strands using a variety of vector-based and internal 
primers with Applied Biosystems Incorporated (ABI) dye terminator sequencing kits 
30 according to the manufacturer's specifications. Sequences were analysed on an ABI 
Model 3 73 A automated DNA sequencer. Peptide homology alignments were 
performed using the program BESTFIT (GCG, Wisconsin). 
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Identification of intron/exon boundaries 

Identification of exon boundaries and flanking regions was carried out using PCR 
with mouse genomic DNA or mVRF genomic lambda clones as templates. The 
primers used in PCR to identify introns were derived from the hVRF sequence and to 
5 allow for potential human-mouse sequence mismatches annealing temperatures 5- 
10°C below the estimated T m were used. All PCR products were sized by agarose 
gel electrophoresis and gel purified using QIAquick spin columns (Qiagen) and the 
intron/exon boundaries were sequenced directly from these products. In addition, 
some splice junctions were sequenced from subcloned genomic fragments of MVRF. 
10 Intron/exon boundaries were identified by comparing cDNA and genomic DNA 
sequences. 

Northern analysis 

Total cellular RNA was prepared from a panel of fresh normal adult mouse tisues 
15 (brain, kidney, liver, muscle) using the method of Chomczynski and Sacchi (1987). 
20[ig of total RNA were electrophoresed, transferred to a nylon membrane (Hybond 
N, Amersham) and hybridised under standard conditions (Church & Gilbert, 1984). 
Filters were washed at 65°C in O.lxSSC (20xSSC is 3M NaCl/0.3M trisodium 
citrate), 0.1% SDS and exposed to X-ray film with intensifying screens at -70°C for 
20 1-3 days. 

Characterisation of mVRF cDNAs 

Murine VRF homologues were isolated by screening a murine cDNA library with an 
hVRF cDNA clone. Five clones of sizes varying from 0.8-1.5 kb were recovered 
25 and sequenced. The cDNA sequences were complied to give a full length 1041 bp 
cDNA sequence covering the entire open reading frame (621 bp or 564 bp depending 
on the splice form, see below) and 3' UTR (379 bp), as well as 163 bp of the 5' 
UTR (Figure 9). 

30 The predicted initiation codon matched the position of the start codon in hVRF. One 
other out of frame ATG was located at position -47 and two termination codons were 
observed upstream (positions -9 and -33, respectively) and in-frame with the putative 
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initiation codon. 

The predicted N-terminal signal peptide of hVRF appears to be present in mVRF 
with 81% identity (17/21 amino acids). Peptide cleavage within mVRF is expected 
to occur after reside 21 (Figure 10). These data suggest that mature mVRF is 
secreted and could therefore conceivably function as a growth factor. 

As with hVRF, two open reading frames (ORFs) were detected in cDNAs isolated by 
library screening. Four of five clones were found to be alternatively spliced and 
lacked a 101 bp fragment homologous to exon 6 of hVRF. The predicted peptide 
sequences of the two isoforms of mVRF were determined and aligned with the 
corresponding human isoforms (Figure 10). 

The message encoding mVRF 186 contains a 621 bp ORF with coding sequences 
terminating at position +622, towards the end of exon 7 (Figure 9). The smaller 
message encoding mVRF l67 actually terminates downstream of the +622 TAG site 
due to a frame shift resulting from splicing out of the 101 bp exon 6 and the 
introduction of a stop codon (TGA) at position +666, near the beginning of exon 8 
(Figure 9). 

The mVRF 186 protein has strong homology to the amino and central portions of 
VEGF while the carboxyl end is completely divergent an is alanine rich. mVRF 167 
possesses these similarities and also maintains homology to mVEGF right through to 
the C-terminus (Figure 11). The overall homology of mVRF 167 to hVRF 167 was 
85% identity and 92% similarity, respectively (Figure 10). Likewise, homology 
between mVRF 167 and mVEGF (Breier et al 9 1992) was 49% identity and 71% 
conservative amino acid substitution, respectively (Figure 11). 

A canonical vertebrate polyadenylation signal (AATAAA) (Birnstiel et al 9 1986) was 
not present in the mVRF cDNA, however, the closely matching sequence GAT AAA 
is present at similar positions in both mouse and human VRF cDNAs (Figure 9). In 
contrast to hVRF, mVRF was found to contain an AC dinucleotide repeat at the 
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extreme 3' end of the 3' UTR (nucleotide positions 998 to 1011, Figure 9). 
Polymorphism of this repeat region was observed between some of the mVRF 
cDNAs, with the number of dinucleotides varying from 7 to 11. 

5 Genomic characterisation of mVRF 

Intron/exon boundaries (Table 3) were mapped using primers which flanked 
sequences homologous to the corresponding hVRF boundaries. Introns I, III, IV and 
VI of mVRF (Table 3, Figure 12) were smaller than the hVRF intervening 
sequences. The complete genomic sequence was compiled from the 5' UTR of 

10 mVRF through to intron VI, the largest intervening region (2.2 kb), by sequencing 
amplified introns and cloned genomic portions of mVRF. There was only one major 
difference in genomic structure between mVRF and hVRF and that was the exon 
7/intron VI boundary of mVRF was located lObp further downstream in relation to 
the cDNA sequence, hence exon 7 in mVRF is lObp longer than the corresponding 

15 exon in hVRF. 

Exons 6 and 7 are contiguous in mVRF, as has been found to occur in the human 
homologue. The strong sequence homology between exon 6 of mVRF and hVRF 
(Figure 1 0) suggests that this sequence is not a retained intronic sequence but rather 
20 encodes a functional part of the VRF 186 isoform. 

General intron/exon structure is conserved between the various members of the 
VEGF gene family (VEGF, PIGF, hVRF) and therefore it is not surprising that the 
overall genomic organisation of the mVRF gene is very similar to these genes 
25 (Figure 12). 

Previous comparative mapping studies have shown that the region surrounding the 
human multiple endocrine neoplasia type 1 disease locus on chromosome llql3 is 
syntenic with the proximal segment of mouse chromosome 19 (Rochelle et al, 1992). 
30 Since the inventors have mapped the hVRF gene to within Ikb of the human MEN! 
locus (see above) it is most likely that the murine VRF gene maps near the 
centromere of chromosome 19. 
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Expression studies of mVRF 

Northern analysis of RNA from adult mouse tissues (muscle, heart, lung and liver) 
showed that expression appears to be ubiquitous and occurs primarily as a major 
band of approximately 1.3kb in size (Figure 14). This is somewhat different to the 
5 pattern observed for hVRF in which two major bands of 2.0 and 5.5 kb have been 
identified in all tissues examined. The 1.3 kb murine message presumably 
corresponds to the shorter of the human transcripts and the size variation thereof is 
most likely due to a difference in the length of the respective 5' UTRs. 

EXAMPLE 6 

EXPRESSION OF MURINE VEGF IN PRE- AND POST-NATAL MOUSE 
Animals 

Timed pregnant (n=4) and young adult (n=2) mice (C57 inbred strain, ALAB, 
Sweden) were sacrificed with carbon dioxide, and the relevant tissues were taken out 
and frozen on a chuck. Tissues were kept at -70°C until further use. Two 
gestational ages was used in this study; embryonic day 8 (E8), 14 and £17. 

In situ hybridisation histochemistry 

In situ hybridisation was performed as previously described (Dagerlind et al 9 1992). 
20 Briefly, transverse sections (14jim) were cut in a cryostat (Microm, Germany), 
thawed onto Probe-On slides (Fisher Scientific, USA) and stored in black sealed 
boxes at -70°C until used. The sequences of the synthetic 42-mer oligonucleotides 
complementary to mRNA encoding mVRF were 

ACCACCACCTCCCTGGGCTGGCATGTGGCACGTGCATAAACG [SEQ ID 

25 NO:ll] (complementary to nt 120-161) and 

AGTTGTTTGACCACATTGCCCATGAGTTCCATGCTCAGAGGC [SEQ ID 
NO: 12] (complementary to nt 162-203). To detect the two alternative splice forms 
oligonucleotide GATCCTGGGGCTGGAGTGGGATGGATGATGTCAGCTGG [SEQ 
ID NO: 13] (complementary to nt xxx-xxx) and 

30 GCGGGCAGAGGATCCTGGGGCTGTCTGGCCTCACAGCACT [SEQ ID NO: 14] 
were used. The probes were labeled at the 3 '-end with deoxyadenosine- 
alpha[thio]triphosphate [ 35 S] (NEN, USA) using terminal deoxynucleotidyl 



10 



15 
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transferase (IBI, USA) to a specific activity of 7-10 x 10 8 cpm/jig and hybridised to 
the sections without pretreatment for 16-18 h at 42°C. The hybridisation mixture 
contained: 50% v/v formamide, 4 x SSC (1 x SSC = 0.15M NaCl and 0.0 15M 
sodium-citrate), 1 x Denhardt's solution (0.02% each of polyvinylpyrrolidone, BSA 
5 and Ficoll), 1% v/v sarcosyl (N-lauroylsarcosine; Sigma), 0.02M phosphate buffer 
(pH 7.0), 10% w/v dextran sulfate (Pharmacia, Sweden), 250|ig/ml yeast tRNA 
(Sigma), 500jag/ml sheared and heat denatured salmon sperm DNA (Sigma) and 
200mM dithiothreitol (DTT; LKB, Sweden). In control sections, the specificity of 
both probes was checked by adding a 20-fold excess of unlabeled probe to the 

10 hybridisation mixture. In addition, adjacent sections were hybridised with a probe 
unrelated to this study which gave a different expression pattern. Following 
hybridisation the sections were washed several times in 1 x SSC at 55°C, dehydrated 
in ethanol and dipped in NTB2 nuclear track emulsion (Kodak, USA). After 3-5 
weeks the sections were development in D-19 developer (Kodak, USA) and cover- 

15 slipped. In some cases, sections were opposed to an autoradiographic film (Beta-max 
autoradiography film Amersham Ltd, UK) prior to emulsion-dipping. 

The four different probes gave identical hybridisation patterns in all tissues examined. 
Mouse VRF expression was detecting already in the E8 embryo, in which positive 

20 signal was recorded over structures most likely corresponding to the neuronal tube. 
In sagittal sections of El 4 mouse embryo the strongest hybridisation signal was 
present over heart and in the nervous system, especially cerebral cortex (Figure 14A). 
A low level of expression was present in all other tissues. At a later gestational age, 
El 7, a high mVRF mRNA signal was confined to he heart and brown fat tissue in 

25 the back and around the neck (Figure 14B). Clearly positive hybridisation signals 
were present in the gray of the spinal cord and in the tongue (Figure 14B). 
Expression in the cerebral cortex was clearly reduced compared to day 14. The weak 
background expression seen in the E14 embryo in for example muscle, had decreased 
at this gestational age. A strong mVRF mRNA hybridisation signal was present 

30 solely over the heart and in the brown fat in the young adult mice (Figure 14C). The 
signal over the heart was evenly distributed ove the entire ventricular wall, including 
the papillary muscles (Figure 14D). In sections of heart tissue hybridised with an 
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excess of cold probe, no specific labeling over background signal was recorded 
(Figure 14E). 

Apart from the heart, mVRF mRNA signal was present over certain tissues on the 
5 outside of the thoracic cage that morphologically resembled brown fat. This was 
verified with sudan black counterstaining, which showed a strong staining in the 
same areas (Figure 15A and 15B). In transverse sections of adult mouse spinal cord, 
the mVRF probes gave a neuronal staining pattern over the gray matter (Figure 15C). 
Counterstaining with toluidine (Figure 15D) showed that motoneurons in the ventral 
10 horn (Figure 15C and 15D), interneurons (Figure 15C) in the deep part of the dorsal 
hom and around the central canal where to a large extent positive for mVRF mRNA. 

EXAMPLE 7 

EFFECTS OF VEGF AND SOM175 PROTEINS ON CHICK 
15 SENSORY NEURONS 

The effects of VEGF and SOM175 proteins on embryonic day 8 chick sensory 
neurons were determined using the method of Nurcombe et al (1992). The neuronal 
assay was read at 48 hours using 2000 cells per assay well. The results were 
obtained using 3 H-thymidine counts. The percentage survival of neurons, neurite 
20 outgrowth and average neurite length in \xm were determined using NGF as positive 
control and various concentrations of VEGF, VEGF in the presence of heparin and 
VEGF in the presence of heparin and 5 fiM, 5'-flurouracil (5FU). 5FU kills glial 
cells. 

25 The results are shown in Figure 16. The results show that VEGF is effective in 

promoting neuronal survival but that this requires the presence of glial cells. Figure 
17 shows the results of the effect of VEGF and SOM175 on three types of chick 
glia. The glia tested were CNS glia, peripheral glia and CNS oligodendrocytes. 
Heparin was used as 10 jag/ml in all cultures and the assay was read at 24 hours. 

30 Results were measured in 3 H-thymidine counts using 2000 cells per well. 
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The results show that for chick central and peripheral neurons, astroglia were 
markedly stimulated to proliferate by SOM175 in the presence of heparin but that 
chick oligodendrocytes showed negligible increase in the rate of division. 

5 

EXAMPLE 8 

EFFECTS OF SOM175 PROTEINS ON MOUSE PRIMARY AND 

CENTRAL NEURONS 

The results in Example 7 show that VEGF isoform had an effect on chick primary 
10 and central neurons through the agency of the astroglial cells. Similar experiments 
were repeated in mouse cells. 

Culture conditions 

Neuronal and gligal cells for all in vitro experiments were prepared and cultured 
15 according o the techniques described in "Methods in Neurosciences (Vol. 2): Cell 
Culture" Ed. P.M. Conn, Academic Press, San Diego, 1990, pp33-46 for astroglial 
cells, pp56-74 for oligodendroglial cells, and pp87-102 for central neurons. 

Cells were plated onto 24-well culture clusters (Nunc) coated with poly-L-ornithine 
20 (0.1 mg/ml, lh) at a density of 2,000 cells/well. After 48 hours in culture, neurons 
were counted in the wells under inverted phase light using well established 
techniques (Maruta et al. 1993) and glial cells assessed with [ 3 H]thymidine uptake to 
monitor cell division rates as below. Heparin (lOjig/ml, low molecular weight 
fraction, Sigma Chemical Corp.) was present at all times in the culture media except 
25 where noted. The neuronal cultures were supplemented with 5mM 5-fluoro-2- 
deoxyuridine (Sigma) to suppress background glial growth. 

3 H-Thymidine incorporation assay for glial cell proliferation 

The cells were pulsed for 14h with 3 H-thymidine (specific activity 103 (iCi/ug) fraom 
30 a stock concentration of 0.1 mCi/ml in standard medium, giving a final incubating 
volume of 20 jal/well. The contents of the wells were harvested and absorbed onto 
nitrocellulose paper (Titertek, Flow). Remaining adherent cells were removed by 
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incubation with trypsin/versene (CSL Limited, Victoria, Australia) for 5 min. This 
procedure was carried out twice. The nitrocellulose discs were washed in a standard 
Titertek harvester (Flow) using first distilled water, and then methanol. The 
nitrocellulose discs were dried, scintillation fluid (containing 5% v/v Triton-X) added 
5 and the discs counted on a scintillation counter. 

Greatest activity was seen with preparations of SOM175 absent exon 6 (SOMaX6) 
on mouse astroglial cell cultures, where there was a significant stimulus to their 
proliferation when delivered in conjunction with heparin (Figure 16). Little stimulus 
10 was given to the proliferation of oligodendroglial cells (Figure 17), and very little 
discemable potentiation of the survival response of isolated forebrain neurons (Figure 
18). The standard deviation on all three graphs for each point was less than 8%. 

The viability of neurons can be maintained by promoting glial cell proliferation. 
15 Furthermore, SOMaX6 is a good inducer of astroglial proliferation and may be 

expressed in conjunction with the formation of astroglial endfeet on central nervous 
system endothelial cells. 

Those skilled in the art will appreciate that the invention described herein is 
20 susceptible to variations and modifications other than those specifically described. It 
is to be understood that the invention includes all such variations and modifications. 
The invention also includes all of the steps, features, compositions and compounds 
referred to or indicated in this specification, individually or collectively, and any and 
all combinations of any two or more of said steps or features. 



25 
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TABLE 3 

Splice junctions of the murine VRF gene 

5 5' UTR* Exon 1 >223bp CCCAGgtacgtgcgt Intron I 495bp 

ttccccacagGCCCC Exon 2 43bp GAAAGgtaataatag Intron II 288bp 

ctgcccacagTGGTG Exon 3 197bp TGCAGgtaccagggc Intron III 196bp 

ctgagcacagATCCT Exon 4 74bp TGCAGgtgccagccc Intron IV 182bp 

ctcttttcagACCTA Exon 5 36bp GACAGattcttggtg Intron V 191 bp 

10 ctcctcctagGGTTG Exon 6 lOlbp (no intron) 

CCCACTCCAGCCCCA Exon 7 135bp TGTAGgtaaggagtc Intron VI ~2200bp 

cactccccagGTGCC Exon 8 394bp AGAGATGGAGACACT 



Uppercase and lowercase letters denote exonic and intronic sequences respectively. 
15 * Indicates that the 5' end of exon 1 has not yet been determined. 
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(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
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(B) FILING DATE: 20-NOV-1995 
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(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: HUGHES DR, E JOHN L 
(C) REFERENCE/DOCKET NUMBER: EJH/EK 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 3 9254 2777 

(B) TELEFAX: +61 3 9254 2770 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17... 589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



TCGGGCCTCC GAAACC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC 4 9 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser 
15 10 

CTT GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA 9 7 

Leu Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala 
15 20 25 

CCC ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC 14 5 

Pro Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe 
30 35 40 

ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG 19 3 

Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val 
45 50 55 

GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA 241 
Asp lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro 
60 65 70 75 



TCC TGT 
Ser Cys 



GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC 
Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly 
80 85 90 
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CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG 33 7 

Leu Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met 
95 100 105 

CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA 385 
Arg He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu 
110 115 120 

CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GGA AGA CAA 433 
Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin 
125 130 135 

GAA AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA 481 
Glu Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val 
140 145 150 155 

CAA GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT 529 
Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg 
160 165 170 

TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC 577 
Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
175 180 185 

AAG CCG AGG CGG TGAG CCGGGC AGGAGGAAGG AGCCTCCCTC AGCGTTTCGG 62 9 
Lys Pro Arg Arg 
190 

GAACCAGATC TCTCACCAGG 64 9 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala Leu Leu Leu 
15 10 15 

Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Met Ala Glu Gly 
20 25 30 

Gly Gly Gin Asn His His Glu Val Val Lys Phe Met Asp Val Tyr Gin 
35 40 45 

Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp He Phe Gin Glu 
50 55 60 
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Tyr Pro Asp Glu 
65 

Met Arg Cys Gly 



Thr Glu Glu Ser 
100 

Gin Gly Gin His 
115 

Glu Cys Arg Pro 
130 

Pro Cys Ser Glu 
145 

Cys Lys Cys Ser 



Leu Glu Leu Asn 
180 



lie Glu Tyr lie 
70 

Gly Cys Cys Asn 
85 

Asn lie Thr Met 



lie Gly Glu Met 
120 

Lys Lys Asp Arg 
135 

Arg Arg Lys His 
150 

Cys Lys Asn Thr 
165 

Glu Arg Thr Cys 



Phe Lys Pro Ser 
75 

Asp Glu Gly Leu 
90 

Gin lie Met Arg 
105 

Ser Phe Leu Gin 



Ala Arg Gin Glu 
140 

Leu Phe Val Gin 
155 

Asp Ser Arg Cys 
170 

Arg Cys Asp Lys 
185 



Cys Val Pro Leu 
80 

Glu Cys Val Pro 
95 

lie Lys Pro His 
110 

His Asn Lys Cys 
125 

Asn Pro Cys Gly 



Asp Pro Gin Thr 
160 

Lys Ala Arg Gin 
175 

Pro Arg Arg 
190 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
<B) LOCATION: 3.. 624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 



CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 
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CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 28 7 

Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 33 5 

Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 383 
Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAC AGG GCT GCC ACT CCC CAC CAC 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Ala Ala Thr Pro His His 
130 135 140 

CGT CCC CAG CCC CGT TCT GTT CCG GGC TGG GAC TCT GCC CCC GGA GCA 4 79 

Arg Pro Gin Pro Arg Ser Val Pro Gly Trp Asp Ser Ala Pro Gly Ala 
145 150 155 

CCC TCC CCA GCT GAC ATC ACC CAT CCC ACT CCA GCC CCA GGC CCC TCT 52 7 

Pro Ser Pro Ala Asp lie Thr His Pro Thr Pro Ala Pro Gly Pro Ser 
160 165 170 175 

GCC CAC GCT GCA CCC AGC ACC ACC AGC GCC CTG ACC CCC GGA CCT GCC 575 
Ala His Ala Ala Pro Ser Thr Thr Ser Ala Leu Thr Pro Gly Pro Ala 
180 185 190 

GCT GCC GCT GCC GAC GCC GCA GCT TCC TCC GTT GCC AAG GGC GGG GCT T 624 
Ala Ala Ala Ala Asp Ala Ala Ala Ser Ser Val Ala Lys Gly Gly Ala 
195 200 205 

AGAGCTCAAC CCAGACACCT GCAGGTGCCG GAAGCTGCGA AGGTGACACA TGGCTTTTCA 6 84 

GACTCAGCAG GGTGACTTGC CTCAGAGGCT ATATCCCAGT GGGGGAACAA AGGGGAGCCT 744 

GGTAAAAAAC AGCCAAGCCC CCAAGACCTC AGCCCAGGCA GAAGCTGCTC TAGGACCTGG 804 

GCCTCTCAGA GGG CTCTTCT GCCATCCCTT GTCTCCCTGA GGCCATCATC AAACAGGACA 864 

GAG TTGG AAG AGGAGACTGG G AGG CAG CAA GAGGGGTCAC ATACCAGCTC AGGGGAGAAT 924 

GGAGTACTGT CTCAGTTTCT AACCACTCTG TGCAAGTAAG CATCTTACAA CTGGCTCTTC 984 

CTCCCCTCAC TAAGAAGACC CAAACCTCTG CATAATGGGA TTTGGGCTTT GGTACAAGAA 1044 



CTGTGACCCC CAACCCTGAT AAAAGAGATG GAAGGAAAAA AAAAAAAAAA 



1094 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
15 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25 30 

Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys Gin 
35 40 45 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 
65 70 75 80 

Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu Gly 
100 105 110 

Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys Lys 
115 120 125 

Lys Asp Ser Ala Val Lys Pro Asp Arg Ala Ala Thr Pro His His Arg 
130 135 140 

Pro Gin Pro Arg Ser Val Pro Gly Trp Asp Ser Ala Pro Gly Ala Pro 
145 150 155 160 

Ser Pro Ala Asp lie Thr His Pro Thr Pro Ala Pro Gly Pro Ser Ala 
165 170 175 

His Ala Ala Pro Ser Thr Thr Ser Ala Leu Thr Pro Gly Pro Ala Ala 
180 185 190 



Ala Ala Ala Asp Ala Ala Ala Ser Ser Val Ala Lys Gly Gly Ala 
195 200 205 
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(2) INFORMATION r'OR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 566 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
1 5 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 14 3 

Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 28 7 

Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 33 5 

Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 3 83 

Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 



AAA AAG 
Lys Lys 



GAC AGT GCT GTG AAG CCA GAT AGC CCC AGG CCC CTC TGC CCA 
Asp Ser Ala Val Lys Pro Asp Ser Pro Arg Pro Leu Cys Pro 
130 135 140 



431 
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CGC TGC ACC CAG CAC CAC CAG CGC CCT GAC CCC CGG ACC TGC CGC TGC 4 79 
Arg Cys Thr Gin His His Gin Arg Pro Asp Pro Arg Thr Cys Arg Cys 
145 150 155 

CGC TGC CGA CGC CGC AGC TTC CTC CGT TGC CAA GGG CGG GGC TTA GAG 527 
Arg Cys Arg Arg Arg Ser Phe Leu Arg Cys Gin Gly Arg Gly Leu Glu 
160 165 170 175 

CTC AAC CCA GAC ACC TGC AGG TGC CGG AAG CTG CGA AGG TGACACATGG 576 
Leu Asn Pro Asp Thr Cys Arg Cys Arg Lys Leu Arg Arg 
180 185 

CTTTTCAGAC TCAGCAGGGT GACTTGCCTC AGAGGCTATA TCCCAGTGGG GGAACAAAGG 63 6 

GGAGCCTGGT AAAAAACAGC CAAGCCCCCA AGAC CTCAGC CCAGGCAGAA GCTGCTCTAG 6 96 

GACCTGGGCC TCTCAGAGGG CTCTTCTGCC ATCCCTTGTC TCCCTGAGGC CATCATCAAA 756 

CAGGACAGAG TTGGAAGAGG AGACTGGGAG GCAGCAAGAG GGGTCACATA CCAGCTCAGG 816 

GGAGAATGGA GTACTGTCTC AGTTTCTAAC CACTCTGTGC AAGT AAG CAT CTTACAACTG 876 

GCTCTTCCTC CCCTCACTAA GAAGACCCAA ACCTCTGCAT AATGGGATTT GGGCTTTGGT 93 6 

ACAAGAACTG TGACCCCCAA CCCTGATAAA AGAGATGGAA GGAAAAAAAA AAAAAAA 993 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Ser Pro Leu Leu Arg Arg Leu 
1 5 

Ala Pro Ala Gin Ala Pro Val Ser 
20 

Arg Lys Val Val Ser Trp lie Asp 
35 40 

Pro Arg Glu Val Val Val Pro Leu 
50 55 

Ala Lys Gin Leu Val Pro Ser Cys 
65 70 



Leu Leu Ala Ala Leu Leu Gin Leu 
10 15 

Gin Pro Asp Ala Pro Gly His Gin 
25 30 

Val Tyr Thr Arg Ala Thr Cys Gin 
45 

Thr Val Glu Leu Met Gly Thr Val 
60 

Val Thr Val Gin Arg Cys Gly Gly 
75 80 
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Cys Cys Pro Asp 



Val Arg Met Gin 
100 

Glu Met Ser Leu 
115 

Lys Asp Ser Ala 
130 

Cys Thr Gin His 
145 

Cys Arg Arg Arg 



Asn Pro Asp Thr 
180 



Asp Gly Leu Glu 
85 

lie Leu Met lie 



Glu Glu His Ser 
120 

Val Lys Pro Asp 
135 

His Gin Arg Pro 
150 

Ser Phe Leu Arg 
165 

Cys Arg Cys Arg 



Cys Val Pro Thr 
90 

Arg Tyr Pro Ser 
105 

Gin Cys Glu Cys 



Ser Pro Arg Pro 
140 

Asp Pro Arg Thr 
155 

Cys Gin Gly Arg 
170 

Lys Leu Arg Arg 
185 



Gly Gin His Gin 
95 

Ser Gin Leu Gly 
110 

Arg Pro Lys Lys 
125 

Leu Cys Pro Arg 



Cys Arg Cys Arg 
160 

Gly Leu Glu Leu 
175 



( 2 ) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
1 5 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 14 3 

Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40. 45 



CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 



191 
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GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Ar y Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 287 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 33 5 

Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 383 
Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAT AGG TGC CGG AAG CTG CGA AGG 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Cys Arg Lys Leu Arg Arg 
130 135 140 

TGACACATGG CTTTTCAGAC TCAGCAGGGT GACTTGCCTC AGAGGCTATA TCCCAGTGGG 4 91 

GGAACAAAGG GGAGCCTGGT AAAAAACAGC CAAGCCCCCA AGAC CTCAGC CCAGGCAGAA 551 

GCTGCTCTAG GACCTGGGCC TCTCAGAGGG CTCTTCTGCC ATCCCTTGTC TCCCTGAGGC 611 

CATCATCAAA CAGGACAGAG TTGGAAGAGG AGACTGGGAG GCAGCAAGAG GGGTCACATA 671 

CCAG CTCAGG GGAGAATGGA GTACTGTCTC AGTTTCTAAC CACTCTGTGC AAGTAAGCAT 731 

CTTACAACTG GCTCTTCCTC CCCTCACTAA GAAGAC CCAA ACCTCTGCAT AATGGGATTT 791 

GGGCTTTGGT ACAAGAACTG TGACCCCCAA CCCTGATAAA AGAGATGGAA GGAAAAAAAA 851 



AAAAAAA 



858 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met Ser Pro Leu 
1 

Ala Pro Ala Gin 
20 

Arg Lys Val Val 
35 

Pro Arg Glu Val 
50 

Ala Lys Gin Leu 
65 

Cys Cys Pro Asp 



Val Arg Met Gin 
100 

Glu Met Ser Leu 
115 

Lys Asp Ser Ala 
130 



Leu Arg Arg Leu 
5 

Ala Pro Val Ser 



Ser Trp lie Asp 
40 

Val Val Pro Leu 
55 

Val Pro Ser Cys 
70 

Asp Gly Leu Glu 
85 

lie Leu Met lie 



Glu Glu His Ser 
120 

Val Lys Pro Asp 
135 



Leu Leu Ala Ala 
10 

Gin Pro Asp Ala 
25 

Val Tyr Thr Arg 



Thr Val Glu Leu 
60 

Val Thr Val Gin 
75 

Cys Val Pro Thr 
90 

Arg Tyr Pro Ser 
105 

Gin Cys Glu Cys 



Arg Cys Arg Lys 
140 



Leu Leu Gin Leu 
15 

Pro Gly His Gin 
30 

Ala Thr Cys Gin 
45 

Met Gly Thr Val 



Arg Cys Gly Gly 
80 

Gly Gin His Gin 
95 

Ser Gin Leu Gly 
110 

Arg Pro Lys Lys 
125 

Leu Arg Arg 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 14 3 

Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 2 87 

Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ACC TAAAAAAAAG GACAGTGCTG TGAAGC CAGA 335 
Gin Val Arg Met Gin Thr 
100 

CAGGGCTGCC ACTCCCCACC ACCGTCCCCA GCCCCGTTCT GTTCCGGGCT GGGACTCTGC 3 95 

CCCCGGAGCA CCCTCCCCAG CTGACATCAC CCATCCCACT CCAGCCCCAG GCCCCTCTGC 455 

CCACGCTGCA CCCAGCACCA CCAGCGCCCT GACCCCCGGA CCTGCCGCTG CCGCTGCCGA 515 

CGCCGCAGCT TCCTCCGTTG CCAAGGGCGG GGCTTAGAGC TCAACCCAGA CACCTGCAGG 575 

TGCCGGAAGC TGCGAAGGTG ACACATGGCT TTTCAGACTC AGCAGGGTGA CTTG CCTCAG 635 



AGGCTATATC CCAGTGGGGA ACAAAGAGGA GCCTGGTAAA AAACAGCCAA GCCCCCAAGA 



695 
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CCTCAGCCCA GGCAGAAGCT GCTCTAGGAC CTGGGCCTCT CAGAGGGCTC TTCTGCCATC 755 
CCTTGTCTCC CTGAGGCCAT CATCAAACAG GACAGAGTTG GAAGAGGAGA CTGGGAGGCA 815 
GCAAGAGGGG TCACATACCA GCTCAGGGGA GAATGGAGTA CTGTCTCAGT TTCTAACCAC 875 
TCTGTGCAAG TAAGCATCTT ACAACTGGCT CTTCC 910 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
15 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25 30 

Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys Gin 
35 40 45 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 
65 70 75 80 

Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin Thr 
100 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ACCACCACCT CCCTGGGCTG GCATGTGGCA CGTGCATAAA CG 42 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



AGTTGTTTGA CCACATTGCC CATGAGTTCC ATGCTCAGAG GC 4 2 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



GATCCTGGGG CTGGAGTGGG ATGGATGATG TCAGCTGG 



38 
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(2) INFORMATION rUR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



GCGGGCAGAG GATC CTGGGG CTGTCTGGCC TCACAGCACT 
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CLAIMS: 

1. A biologically isolated proteinaceous molecule having the following 
characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF). 

2. A proteinaceous molecule according to claim 1 wherein the molecule exhibits at 
least one of the following properties: 

(i) an ability to induce vascular endothelial cells; 

(ii) an ability to interact with flt-\ljlk-\ family of receptors; and/or 

(iii) an ability to induce cell migration, cell survival and/or an increase in 
intracellular lavels of alkaline phosphatase. 

3. A proteinaceous molecule according to claim 1 or 2 wherein said molecule has 
the capacity to induce astroglial proliferation. 

4. A proteinaceous molecule according to claim 1 wherein said molecule is of 
human origin. 

5. A proteinaceous molecule according to claim 1 wherein said molecule is of non- 
human origin. 

6. A proteinaceous molecule according to claim 5 wherein said molecule is of 
livestock animal, companion animal, laboratory test animal, avian, fish or reptilian 
origin. 

7. A proteinaceous molecule according to claim 5 wherein said molecule is encoded 
by a gene located at chromosome 1 lql3. 
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8. A proteinaceous molecule according to claim 1 wherein the percentage similarity 
to SEQ ID NO:2 is at least about 30%. 

9. A proteinaceous molecule according to claim 1 wherein the percentage similarity 
to SEQ ID NO:2 is at least about 40%. 

10. A proteinaceous molecule according to claim 1 wherein the percentage similarity 
to SEQ ID NO:2 is at least about 60-70%. 

11. A proteinaceous molecule according to claim 1 comprising a sequence of amino 
acids as set forth in SEQ ID NO:4 or a part, fragment, derivative or analogue thereof. 

12. A proteinaceous molecule according to claim 1 comprising an amino acid 
sequence substantially set forth in SEQ ID NO:6 or a part, fragment, derivative or 
analogue thereof. 

13. A proteinaceous molecule according to claim 1 comprising an amino acid 
sequence substantially set forth in SEQ ID NO: 8 or a part, fragment, derivative or 
analogue thereof. 

14. A proteinaceous molecule according to claim 1 comprising an amino acid 
sequence substantially set forth in SEQ ID NO: 10 or a part, fragment, derivative or 
analogue thereof. 

15. A recombinant molecule having the following characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:4 
or having at least about 15% similarity to but at least about 5% 
dissimilarity to the amino acid sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 
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16. A recombinant molecule having the following characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO: 6 
or having at least about 15% similarity to but at least about 5% 
dissimilarity to the amino acid sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

17. A recombinant molecule having the following characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO: 8 
or having at least about 15% similarity to but at least about 5% 
dissimilarity to the amino acid sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

18. A recombinant molecule having the following characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID 
NO: 10 or having at least about 15% similarity to but at least 
about 5% dissimilarity to the amino acid sequence set forth in 
SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

19. A recombinant molecule according to claim 15 or 16 or 17 or 18 having at least 
one of the following properties: 

(a) an ability to induce vascular endothelial cells; 

(b) an ability to interact with fltl/jlld family of receptors; 

(c) an ability to induce cell migration, cell survival and/or increase 
intracellular levels of alkaline phosphatase. 

20. A recombinant molecule according to claim 15 or 16 or 17 or 18 having the 
capacity to induce astroglial proliferation. 
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21. A recombinant molecule according to claim 20 wherein the molecule comprises 
an amino acid sequence substantially as set forth in SEQ ID NO:6. 

22. A peptide fragment corresponding to a portion of the amino acid sequence set 
forth in SEQ ID NO:4 or a derivative or chemical equivalent thereof. 

23. A peptide fragment according to claim 22 having the sequence set forth in SEQ 
ID NO: 6 or a chemical equivalent thereof. 

24. A peptide fragment according to claim 22 having the sequence set forth in SEQ 
ID NO: 8 or a chemical equivalent thereof. 

25. A peptide fragment according to claim 22 having the sequence set forth in SEQ 
ID NO: 10 or a chemical equivalent thereof. 

26. A nucleic acid molecule comprising a sequence of nucleotides or complementary 
to a sequence encoding a proteinaceous molecule having the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF). 

27. A nucleic acid molecule according to claim 26 wherein the proteinaceous 
molecule exhibits at least one of the following properties: 

(i) an ability to induce vascular endothelial cells; 

(ii) an ability to interact with fltAlflkA family of receptors; and/or 

(iii) an ability to induce cell migration, cell survival and/or an increase in 
intracellular lavels of alkaline phosphatase. 

28. A nucleic acid molecule according to claim 27 wherein the proteinaceous 
molecule has the capacity to induce astroglial proliferation. 
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29. A nucleic acid molecule according to claim 28 wherein said molecule encodes 
an amino acid sequence substantially as set forth in SEQ ID NO:6. 

30. A nucleic acid molecule according to claim 1 wherein said molecule is of human 
origin. 

31. A nucleic acid molecule according to claim 1 wherein the percentage similarity 
to SEQ ID NO:2 is at least about 30%. 

32. A nucleic acid molecule according to claim 26 comprising a nucleotide sequence 
substantially as set forth in SEQ ID NO:3 or having at least 15% similarity thereto or 
capable of hybridising under low stringency conditions to a reverse complement of the 
nucleotide sequence as set forth in SEQ ID NO:3 provided that the nucleotide sequence 
has at least 15% similarity but at least 30% dissimilarity to the nucleotide sequence set 
forth in SEQ ID NO:3. 

33. A nucleic acid molecule according to claim 26 encoding a murine homologue of 
human VEGF and comprising a nucleotide sequence substantially as set forth in Figure 
9. 

34. A pharmaceutical composition comprising a proteinaceous molecule according 
to claim 1 or 2 or 3 or 1 1 and one or more pharmaceutical^ acceptable carriers and/or 
diluents. 

35. A method for preparing a recombinant molecule having the following 
characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF), 

said method comprising expressing a nucleic acid molecule encoding said recombinant 
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molecule by a suitable host grown under conditions effective to synthesise said 
recombinant molecule and then isolating said molecule. 

36. A method according to claim 35 wherein the nucleic acid molecule comprises a 
sequence of nucleotides as set forth in SEQ ID NO:3 or having at least 15% similarity 
thereto or is capable of hybridising under low stringency conditions to a reverse 
complement of the nucleotide sequence as set forth in SEQ ID NO:3 provided that the 
nucleotide sequence has at least 15% similarity but at least 30% dissimilarity to the 
nucleotide sequence set forth in SEQ ID NO:3. 

37. A method of inducing astroglial proliferation in a mammal, said method 
comprising administering to said mammal an effective amount of a recombinant 
proteinaceous molecule having the characteristics: 

(i) comprises an amino acid sequence having at least about 1 5% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF), 

said administration being for a time and under conditions sufficient to induce astroglial 
proliferation. 

38. A method according to claim 37 wherein the recombinant proteinaceous molecule 
comprises an amino acid sequence substantially as set forth in SEQ ID NO:3 or is a 
derivative thereof. 

39. A method according to claim 37 wherein the recombinant proteinaceous molecule 
comprises an amino acid sequence substantially as set forth in SEQ ID NO: 6 or is a 
derivative thereof. 

40. A method of promoting neuronal survival and/or proliferation in a mammal, said 
method comprising administering to said mammal an effective amount of a recombinant 
proteinaceous molecule having the characteristics: 
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(i) comprises an amino acid sequence having at least about 15% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF), 

said administration being for a time and under conditions sufficient to induce astroglial 
proliferation. 

41. A method according to claim 40 wherein the recombinant proteinaceous molecule 
comprises an amino acid sequence substantially as set forth in SEQ ID NO:3 or is a 
derivative thereof. 



41 . A method according to claim 40 wherein the recombinant proteinaceous molecule 
comprises an amino acid sequence substantially as set forth in SEQ ID NO:6 or is a 
derivative thereof 
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Fig.Kiii) 


Fig.Kiv) 
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1 TCGGCCTCC GAAACC ATG AAC TTT CTG 

Met Asn Phe Leu 
1 

5 0 CTT GCC TTG CTG CTC TAC CTC CAC 

Leu Ala Leu Leu Leu Tyr Leu His 

15 

98 CCC ATG GCA GAA GGA GGA GGG CAG 

Pro Met Ala Glu Gly Gly Gly Gin 
30 35 

146 ATG GAT GTC TAT CAG CGC AGC TAC 
Met Asp Val Tyr Gin Arg Ser Tyr 
45 50 

194 GAC ATC TTC CAG GAG TAC CCT GAT 
Asp lie Phe Gin Glu Tyr Pro Asp 
60 65 

242 TCC TGT GTG CCC CTG ATG CGA TGC 
Ser Cys Val Pro Leu Met Arg Cys 

80 

2 90 CTC GAG TGT GTG CCC ACT GAG GAG 
Leu Glu Cys Val Pro Thr Glu Glu 

95 

33 8 CGG ATC AAA CCT CAC CAA GGC CAG 
Arg Ily Lys Pro His Gin Gly Gin 
110 115 



Fig. Hi) 
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CTG TCT TGG GTG CAT TGG AGC 49 
Leu Ser Trp Val His Trp Ser 
5 10 

CAT GCC AAG TGG TCC CAG GCT GCA 97 
His Ala Lys Trp Ser Gin Ala Ala 
20 25 

AAT CAT CAC GAA GTG GTG AAG TTC 145 
Asn His His Glu Val Val Lys Phe 

40 

TGC CAT CCA ATC GAG ACC CTG GTG 193 
Cys His Pro lie Glu Thr Leu Val 

55 

GAG ATC GAG TAC ATC TTC AAG CCA 241 
Glu lie Glu Tyr lie Phe Lys Pro 
70 75 

GGG GGC TGC TGC AAT GAC GAG GGC 2 89 

Gly Gly Cys Cys Asn Asp Glu Gly 
85 90 

TCC AAC ATC ACC ATG CAG ATT ATG 337 
Ser Asn lie Thr Met Gin lie Met 
100 105 

CAC ATA GGA GAG ATG AGC TTC CTA 3 85 

His lie Gly Glu Met Ser Phe Leu 

120 



Fig. 1(H) 
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3 86 CAG CAC AAC AAA TGT GAA TGC AGA 
Gin His Asn Lys Cys Glu Cys Arg 
125 130 

434 GAA AAT CCC TGT GGG CCT TGC TCA 
Glu Asn Pro Cys Gly Pro Cys Ser 
140 145 

482 CAA GAT CCG CAG ACG TGT AAA TGT 
Gin Asp Pro Gin Thr Cys Lys Cys 

160 

53 0 TGC AAG GCG AGG CAG CTT GAG TTA 
Cys Lys Ala Arg Gin Leu Glu Leu 

175 

57 8 AAG CCG AGG CGG TGAGCCGGGC AGGAG 
Lys Pro Arg Arg 
190 

63 0 GAACCAGATC TCTCACCAGG 



Fig. Kiii) 
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CCA AAG AAA GAT AGA GCA AGA CAA 
Pro Lys Lys Asp Arg Ala Arg Gin 

135 



433 



GAG CGG AGA AAG CAT TTG TTT GTA 
Glu Arg Arg Lys His Leu Phe Val 
150 155 

TCC TGC AAA AAC ACA GAC TCG CGT 
Ser Cys Lys Asn Thr Asp Ser Arg 
165 170 



481 



529 



AAC GAA CGT ACT TGC AGA TGT GAC 
Asn Glu Arg Thr Cys Arg Cys Asp 
180 185 



577 



GAAGG AGCCTCCCTC AGCGTTTCGG 



629 



Fig.Kiv) 
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Fig .2(1) 


Fig.2(ii) 
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1 CC ATG AGC CCT CTG CTC CGC CGC 

Met Ser Pro Leu Leu Arg Arg 
1 5 

48 CTG GCC CCC GCC CAG GCC CCT GTC 

Leu Ala Pro Ala Gin Ala Pro Val 

20 

9 6 CAG AGG AAA GTG GTG TCA TGG ATA 

Gin Arg Lys Val Val Ser Trp lie 

35 

144 CAG CCC CGG GAG GTG GTG GTG CCC 
Gin Pro Arg Glu Val Val Val Pro 
50 55 

192 GTG GCC AAA CAG CTG GTG CCC AGC 
Val Ala Lys Gin Leu Val Pro Ser 
65 70 

240 GGC TGC TGC CCT GAC GAT G6C CTG 
Gly Cys Cys Pro Asp Asp Gly Leu 
80 85 

2 88 CAA GTC CGG ATG CAG ATC CTC ATG 
Gin Val Arg Met Gin lie Leu Met 

100 

33 6 GGG GAG ATG TCC CTG GAA GAA CAC 
Gly Glu Met Ser Leu Glu Glu His 

115 



Fig.2(i) 
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CTG CTG CTC GCC GCA CTC CTG CAG 
Leu Leu Leu Ala Ala Leu Leu Gin 
10 15 

TCC CAG CCT GAT GCC CCT GGC CAC 
Ser Gin Pro Asp Ala Pro Gly His 
25 30 

GAT GTG TAT ACT CGC GCT ACC TGC 
Asp Val Tyr Thr Arg Ala Thr Cys 
40 45 

TTG ACT GTG GAG CTC ATG GGC ACC 
Leu Thr Val Glu Leu Met Gly Thr 

60 

TGC GTG ACT GTG CAG CGC TGT GGT 
Cys Val Thr Val Gin Arg Cys Gly 

75 

GAG TGT GTG CCC ACT GGG CAG CAC 
Glu Cys Val Pro Thr Gly Gin His 
90 95 

ATC CGG TAC CCG AGC AGT CAG CTG 
lie Arg Tyr Pro Ser Ser Gin Leu 
105 110 

AGC CAG TGT GAA TGC AGA CCT AAA 
Ser Gin Cys Glu Cys Arg Pro Lys 
120 125 



47 



Fig. 2(H) 
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95 



143 



191 



239 



287 



335 



383 
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3 84 AAA AAG GAC AGT GCT GTG AAG CCA 
Lys Lys Asp Ser Ala Val Lys Pro 
130 135 

43 2 CGT CCC CAG CCC CGT TCT GTT CCG 
Arg Pro Gin Pro Arg Ser Val Pro 
145 150 

48 0 CCC TCC CCA GCT GAC ATC ACC CAT 
Pro Ser Pro Ala Asp lie Thr His 
160 165 

52 8 GCC CAC GCT GCA CCC AGC ACC ACC 
Ala His Ala Ala Pro Ser Thr Thr 

180 

57 6 GCT GCC GCT GCC GAC GCC GCA GCT 
Ala Ala Ala Ala Asp Ala Ala Ala 

195 



Fig. 2 (Hi) 
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GAC AGG GCT GCC ACT CCC CAC CAC 431 
Asp Arg Ala Ala Thr Pro His His 

140 

GGC TGG GAC TCT GCC CCC GGA GCA 47 9 

Gly Trp Asp Ser Ala Pro Gly Ala 

155 

CCC ACT CCA GCC CCA GGC CCC TCT 52 7 

Pro Thr Pro Ala Pro Gly Pro Ser 
170 175 

AGC GCC CTG ACC CCC GGA CCT GCC 575 
Ser Ala Leu Thr Pro Gly Pro Ala 
185 190 

TCC TCC GTT GCC AAG GGC GGG GCT T 624 
Ser Ser Val Ala Lys Gly Gly Ala 
200 205 



Fig.2(iv) 
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62 5 AGAGCTCAAC CCAGACACCT GCAGGTGCCG 

685 GACTCAGCAG GGTGACTTGC CTCAGAGGCT 

745 GGTAAAAAAC AGCCAAGCCC CCAAGACCTC 

805 GCCTCTCAGA GGGCTCTTCT GCCATCCCTT 

865 GAGTTGGAAG AGGAGACTGG GAGGCAGCAA 

82 5 GGAGTACTGT CTCAGTTTCT AACCACTCTG 

985 CTCCCCTCAC TAAGAAGACC CAAACCTCTG 

1045 CTGTGACCCC CAACCCTGAT AAAAG AG AT G 



Fig.2(v) 
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GAAGCTGCGA AGGTGACACA TGGCTTTTCA 684 

ATATCCCAGT GGGGGAACAA AGGGGAGCCT 744 

AGCCCAGGCA GAAGCTGCTC TAGGACCTGG 804 

GTCTCCCTGA GG C CAT CAT C AAACAGGACA 864 

GAGGGGTCAC ATACCAGCTC AGGGGAGAAT 924 

TGCAAGTAAG CATCTTACAA CTGGCTCTTC 9 84 

CATAATGGGA TTTGGGCTTT GGTACAAGAA 1044 

GAAGGAAAAA AAAAAAAAAA 1094 

Fig.2(vi) 



SUBSTITUTE SHEET (Rule 26) 



WO 96/27007 



13/52 



PCT/AU96/00094 



U/52 


15/52 


Fig. 3d) 


Fig.3(ii) 



SUBSTITUTE SHEET (RULE 26) 



WO 96/27007 



PCT/AU96/00094 



%/52 



>VEGF_HUMAN VEGF_HUMAN VASCULAR ENDOTHELIAL 

(VASCULAR 215 AA. 
LENGTH = 215 

SCORE = 181 (92.4 BITS), EXPECT = 6.4e-20, 
IDENTITIES = 33/75 (44%), POSITIVES = 48/75 

QUERY: 31 HQRKWSWIDVYTRATCQPREVWPLTVEL 

+++ W +DVY R+ C+P E +V + E 
SBJCT: 3 6 NHHEWKFMDVYQRSYCHPIETLVDIFQEY 

QUERY: 91 PTGQHQVRMQILMIR 105 

PT + + MQI+ 1+ 
SBJCT: 96 PTEESNITMQ IMRIK 110 

SCORE = 76 (38.8 BITS), EXPECT = 0.0011, 
IDENTITIES = 12/19 (63%), POSITIVES = 16/19 

QUERY: 110 QLGEMSLEEHSQCECRPKK 12 8 

++GEMS +H+ CECRPKK 
SBJCT: 116 HIGEMSFLQHNKCECRPKK 134 

SCORE = 72 (36.8 BITS), EXPECT = 0.0046, 
IDENTITIES = 14/21 (66%), POSITIVES = 15/21 

QUERY: 2 02 RCQGRGLELNPDTCRCRKLRR 222 

RC +R LELN TCRC K RR 
SBJCT: 195 RCKARQLELNERTCRCDKPRR 215 

SCORE = 46 (23.5 BITS), EXPECT =47., 
IDENTITIES = 6/10 (60%), POSITIVES = 9/10 



QUERY: 187 DPRTCRCRCR 196 

DP+TC+C C+ 
SBJCT: 181 DPQTCKCSCK 190 
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GROWTH FACTOR PRECURSOR (VEGF) 



P = 6 . 4e-20 
(64%) 

MGTVAKQLVPSCVTVQRCGGCCPDDGLECV 90 

+ PSCV + RCGGCC D+GLECV 

PDEIEYIFKPSCVPLMRCGGCCNDEGLECV 9 5 



POISSON P(2) = 9.1e-12 
(84%) 



POISSON P(3) = 3.6e-18 

(71%) 



POISSON P(4) = 7.3e-10 
(90%) 



Fig. 3d) 
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Fig. 4(H) 
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FigMiii) 


FigA(iv) 
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Gap Weight: 3. 00 Average Match: 1.0 00 

Length Weight: 0.100 Average Mismatch :- 0 . 900 
Quality: 100 . 9 Length: 739 

Ratio: 0.175 Gaps: 30 

Percent Percent 
Similarity: 69 .703 Identity: 69 . 703 

2 8 ATGAGCCCTCTGCTCCGCCGCCTGC 

Mil I Mill I II 

17 ATGAACTTTCTGCT GTCT . . 

68 TGCAGCTGGCCCCCGCCCAGGCCCC 

III III II I Ml Ml 

57 TGCTGCTCTACCTCCACCATGCCAA 

118 C AC C AGAGGA 

Mill 

106 AGAAGGAGGAGGGCAGAATCATCAC 
140 GTGTATACTCGC . GCTACCTGCCAG 

II III Ml MM 111,11 

152 GTCTATCAGCGCAGCTA . CTGCCAT 
194 T. . . .GA CTGTGGAGCTCAT 

I II III III II 

2 01 T C C AGG AGT AC C C TG ATG AG AT C G A 
2 35 CCCAGCTGCGTGACTGTGCAGCGCT 

M Ml III I II III I 

2 39 CCATCCTGTGTGCCCCTGATGCGAT 
2 85 CCTGGAGTGTGTGCCCACTGGGCAG 

I I ! I M I ' I I I I M I I II I | || 

2 8 9 CCTGGAGTGTGTGCCCACTGAGGAG 



Fig. 40) 
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TGCTCGCCGCACT CC 67 

I I III I I 

. . . TGGGTGCATTGGAGCCTTGCCT 5 6 



TGTCTCCCAGCCTGATGCCCCTGGC 117 

MINI III III I 1 1 I 

GTGGTCCCAGGCTGCA . CCCATGGC 105 

. AAGTGGTG . . . . T CAT GG AT AG AT 147 

1 1 II II 1 1 Mill III 

GAAGTGGTGAAGTTCATG . . . .GAT 151 

CCCCGGGAG . . . GTGGTGGTGCCCT 193 

N ill niiii m 

C C AAT CG AGAC C C TGGTGG AC ATCT 2 00 

GGGCACCGTGGCCAAACAGCTGGTG 234 

I MM III I 

GTACATCTT . . . CAA G 238 



GTGGTGGCTGCTGCCCTGACGATGG 2 84 

I II lllllllll MUM II 

GCGGGGGCTGCTGCAATGACGAGGG 2 8 8 

C AC C AAGTC C GGATGC AGAT 32 9 

I I 1 1 MINIM 

TCCAACATCACCATGCAGATTATGC 33 8 

FigA(ii) 
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3 3 0 CCTCATGATCCGGTACC 

1 1 1 1 1 | 

33 9 GGATCAAACCTCA c 

3 69 GTCCCTGGAAGAACACAGCCAGTGT 

I M I I Mil I I III. 

376 GAGCTTCCTACAGCACAACAAATGT 
419 GTGCTGTGAAGCCAGACAGGGCTGC 

I III ill! I 

423 G AGC AAGAC AAG 

469 CGTTCTGTTCCGGGCTGGGACTCTG ' 

I II II III II 

443 ... TGTGGGCCTTGCTCAGA 

519 CATCACCCATCCCACTCCAGCCCCA 

468 

5 69 GC ACCACCAGCGCCC 

II II I 

469 GCATTTGTTTGTACAA 

6 09 TGCCGACGCCGCAGCTTCCTCCGTT 

II II I Ml II MM 

5 09 TG . C AAAAAC AC AG AC T C . . GCGTT 
657 AACCCAGACACCTGCAGGTGCCGGA 

Ml I 

554 AACGAACGTACTTGCAGATGTGACA 

Fig.A-(iii) 
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CGAGCAGTCAGC . . . TGGGGGAGAT 3 68 

I II I MM I II Mill 

CAAG . . GC C AGC AC AT AGGAGAGAT 375 

GAATGCAGACCTAAAAAAAAGGACA 418 



GAATGCAGACC . . . AAAGAAAGATA 422 

CACTCCCCACCACCGTCCCCAGCCC 4 68 

I I I I I 

AAAATCCC 442 

CCCCCGGAGCACCCTCCCCAGCTGA 518 

Mill I 

. . . GCGGAGAA , 467 

GGCCCCTCTGCCCACGCTGCACCCA 56 8 

I 

A 468 

TGACCCCCGGACCTGCCGCTGCCGC 608 

II II I III II ll I 

. GATCCGCAGACGTGTAAATGTTCC 508 

GCCAAGGGCGGGGC . . TTAGAGCTC 656 

II MM III I III I 

GC . . AAGGCGAGGCAGCTTGAGTTA 553 

AGCTGCGAAGGTGA 695 

• AGCCGAGGCGGTGA 592 

FigA(iv) 
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Fig. 5(H) 


Fig.5(iii) 
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Fig.5(iv) 


Fig.5(v) 
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gcacgagctcaggccgtcgctgcggcgctg 
gggggccgcggaggagccgccccctgcgcc 
ggcggctctggctgacccccccccacaccg 



1 6 CGTCGCCTGCTGCTTGTTGCACTGCTGCAG 
RRLLLVALLQ 

7 6 TT TGATGG C C C C AGT C AC C AG AAG AAAGTG 
FDGPSHQKKV 

13 6 ACATGCCAGCCCAGGGAGGTGGTGGTGCCT 
TCQPREVVVP 

19 6 AAACAACTAGTGCCCAGCTGTGTGACTGTG 
KQLVPSCVTV 

2 56 GGCCTGGAATGTGTGCCCACTGGGCAACAC 
GLECVPTGQH 

316 TACCCGAGCAGTCAGCTGGGGGAGATGTCC 
YPSSQLGEMS 

37 6 CCTAAAAAAAAGGAGAGTGCTGTGAGGCCA 
PKKKESAVRP 

43 6 CAGCCCCGCTCTGTTCCGGGCTGGGACTCT 
QPRSVPGWDS 
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cgttgcgctgcctgcgcccagggctcggga 
ccgccccgggtccccgggtccgcgccatgg 
ccgggctagggcccgATGAGCCCCCTGCTG 

M S P L L -17 

CTGGCTCGCACCCAGGCCCCTGTGTCCCAG 
LARTQAPVSQ 4 

GTGCCATGGATAGACGTTTATGCACGTGCC 
VPWIDVYARA 24 

CTGAGCATGGAACTCATGGGCAATGTGGTC 
LSME L MGNVV 44 

CAGCGCTGTGGTGGCTGCTGCCCTGACGAT 
QRCGGCCPDD 64 

CAAGTCCGAATGCAGATCCTCATGATCCAG 
QVRMQILMIQ 84 

CTGGGAGAACACAGCCAATGTGAATGCAGA 
LGEHSQCECR 104 

GACAG GGTTGCCATACCCCACCACCGTCCC 
DRVAI PHHRP 124 

ACCCCGGGAGCACCCTCCCCAGCTGACATC 
TPGAPSPADI 144 
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49 6 ATCCATCCCACTCCAG CCrrA^nATrr-rrnrr 

IHPTPAPGSS 

S P R I L 

55 6 CTGACCCCCGGACCTGCCGTTGCCGCTGTA 
LTPGPAVAAV 
PDPRTCRCRC 

616 GGGGCTTAGAGCTCAACCCAGACACCTGTA 
G A * 

RGLELNPDTC 

676 ctttccagactccacgggcccggctgcttt 

73 6 agcacaggcgtaacctcctcagtctgggag 

796 gagctctctcgccatcttttatctcccaga 

85 6 atgtctcacctcaggggccagggtactctc 

916 ttctggctggctgtctcccctcactatgaa 

97 6 gggttctgttatgataactgtgiacacacac 



103 6 gacactaaaaaaaaaaaaaaaaaaaaaaaa 
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acacac 



aaaaaaaaaaaa 



Fig.9(iv) 



GCCCGCCTTGCACCCAGCGCCGCCAACGCC 

ARLAPSAANA 164 
CPPCTQRRQR 130 

GACGCCGCCGCTTCCTCCATTGCCAAGGGC 
DAAAS S IAKG 184 

RRRRFLHCQG 150 
I 

GGTGCCGGAAGCCGCGAAAGTGAcaagctg 

186 

RCRKPRK* 167 

tatggccctgcttcacagggagaagagtgg 
gtcactgccccaggacctggaccttttaga 
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